Building Patterns for Biomedical Event Extraction
نویسندگان
چکیده
Generally, Event Extraction is to identify any instance of a particular class of events in a natural language text, to extract the relevant arguments of the event, and to represent the extracted information into a structured form.1Let us define Event on the binary relation between two entities for special event verbs which are predefined by biologists. Here, Entity means biomedical entities such as proteins, genes, cells, tissues, etc. According to the definition of event, our event extraction system considers only such sentences which contain at least one event verb and two entities. The training consists of two procedures (Figure 1). First, the preprocessor involves chunking, named entity tagging, dependency relation tagging and sentence normalization with special items for building patterns. Special items are entities, event verbs, non-event verbs, prepositions, relatives, conjunctions and symbols. Second, all possible candidate events are extracted from the training corpus and the corresponding patterns are also generated. At this time, we utilize the following assumptions: one event can be described by one or more patterns in the whole documents and one pattern also can be generated by one or more events. Therefore, the event and the pattern information has reciprocal relation. We use the event score (Equation 1) to measure the reliability of extracted events and the pattern score (Equation 2) to measure the reliability of extracted patterns. The scores are iteratively updated in a co-updating method. Updating the event score causes reranking of candidate events and the iteration is continued until the ranking of events is no longer changed. The result of the training is a set of generated patterns and their scores. The events in training corpus are also extracted as the by-product of the training.
منابع مشابه
A fast rule-based approach for biomedical event extraction
In this paper we present a biomedical event extraction system for the BioNLP 2013 event extraction task. Our system consists of two phases. In the learning phase, a dictionary and patterns are generated automatically from annotated events. In the extraction phase, the dictionary and obtained patterns are applied to extract events from input text. When evaluated on the GENIA event extraction tas...
متن کاملA Pattern Approach for Biomedical Event Annotation
We describe our approach for the GENIA Event Extraction in the Main Task of BioNLP Shared Task 2011. There are two important parts in our method: Event Trigger Annotation and Event Extraction. We use rules and dictionary to annotate event triggers. Event extraction is based on patterns created from dependent graphs. We apply UIMA Framework to support all stages in our system.
متن کاملUnsupervised Event Extraction from Biomedical Text Based on Event and Pattern Information
In this paper, we proposed a new event extraction method from biomedical texts. It can extend patterns by unsupervised way based on event and pattern information. Evaluation of our system on GENIA corpus achieves 90.1% precision and 70.0% recall.
متن کاملInter-Event Dependencies support Event Extraction from Biomedical Literature
The description of events in biomedical literature often follows discourse patterns. For example, authors may firstly mention the transcription of a gene, and then go on to describe how this transcription is regulated by another gene. Capturing such patterns can be beneficial when we want to extract event mentions from literature. For instance, detecting the mention of a transcription of gene A...
متن کاملA Hybrid approach for biomedical event extraction
In this paper we propose a system which uses hybrid methods that combine both rule-based and machine learning (ML)-based approaches to solve GENIA Event Extraction of BioNLP Shared Task 2013. We apply UIMA1 Framework to support coding. There are three main stages in model: Pre-processing, trigger detection and biomedical event detection. We use dictionary and support vector machine classifier t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004